Connected Digits Recognition Task: ISTC–CNR Comparison of Open Source Tools

نویسندگان

  • Piero Cosi
  • Mauro Nicolao
چکیده

EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. In this work, the results of three open source ASR toolkits will be described. CSLU Speech Tools, CSLR SONIC, CMU SPHINX are applied on the EVALITA clean and noisy digits recognition task and this report will describe the complete evaluation methodology. CSLR SONIC has resulted to have the best performances in all the tasks and even with high specialized trainings. We think that it is mostly because of the PMVDR features used in this system. CMU SPHINX has been the easiest system to train and test and its general performances are only slightly lower than SONIC. CSLU Speech Tools is the most specialized recognition system on digit and its score stands in the middle of the others. Overall, the three systems have Word Accuracy score over 90%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evalita-istc Comparison of Open Source Tools on Clean and Noisy Digits Recognition Tasks

1. ABSTRACT EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. The general objective of EVALITA is to promote the development of language and speech technologies for the Italian language, providing a shared framework where different systems and approaches can be evaluated in a consistent manner. In this work the results of the e...

متن کامل

AKIRA: a Framework for MABS

Here we present AKIRA, a framework for Agent-based cognitive and social simulations. AKIRA is an open-source project, currently developed mainly at ISTC-CNR, that exploits state-of-the-art techniques and tools. It gives to the programmer a number of facilities for building Agents at different level of complexity (e.g. reactive, deliberative, layered). Here we describe the main architectural fea...

متن کامل

Designing and Implementing MABS in AKIRA

Here we present AKIRA, a framework for Agent-based cognitive and social simulations. AKIRA is an open-source project, currently developed mainly at ISTC-CNR, that exploits state-of-the-art techniques and tools. It gives to the programmer a number of facilities for building Agents at different levels of complexity (e.g. reactive, deliberative, layered). Here we describe the main architectural fe...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

A Facial Animation Framework with Emotive/expressive Capabilities

LUCIA is an MPEG-4 facial animation system developed at ISTC-CNR.. It works on standard Facial Animation Parameters and speaks with the Italian version of FESTIVAL TTS. To achieve an emotive/expressive talking head LUCIA was build from real human data physically extracted by ELITE optotracking movement analyzer. LUCIA can copy a real human by reproducing the movements of passive markers positio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009